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(57) Abstract: The present invention provides novel engineered derivatives of green fluorescent protein (GFP) which have an amino 
acid sequence which is modified by amino acid substitution compared with the amino acid sequence of wild type Green Fluorescent 
Protein. The modified GFPs exhibit enhanced fluorescence relative to wtGFP when expressed in non-homologous cells at tempera- 
tures above 30 °C, and when excited at about 490 nm compared to the parent proteins, i.e. wtGFP. An example of a preferred protein 
is F64L-S175G-E222G-GFP. The modified GFPs provide a means for detecting GFP reporters in mammalian cells at lower levels 
of expression and/or increased sensitivity relative to wtGFP. This greatly improves the usefulness of fluorescent proteins in studying 
cellular functions in living cells. 
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MUTANTS OF GREEN FLUORESCENT PROTEIN 

The present invention relates to novel variants of the fluorescent 
protein GFP having improved fluorescence properties. 

The use of Green Fluorescent Protein (GFP) derived from Aequorea 
victoria has revolutionised research into many cellular and molecular- 
biological processes. GFP allows researchers to label proteins within cells 
with an intrinsic fluor, so obviating the requirement to perform chemical 
labelling of proteins, and allowing development of assays of biological 
processes in intact living cells. 

US 5491084 describes the use of GFP as a biological reporter. Early 
applications of GFP as a biological reporter (Chalfie et al. Science, (1994), 
263 , 802-5; Chalfie, et al, Photochem.Photobiol., (1995), 62(4), 651-6) used 
wild type (native) GFP (wtGFP), but these studies quickly demonstrated two 
areas of deficiency of wtGFP as a reporter for use in mammalian cells. 
Firstly, the protein being derived from a poikilothermic marine organism does 
not undergo protein folding efficiently when expressed in mammalian cells 
cultured at 37 °C, resulting in weak fluorescence. Secondly, the spectral 
characteristics of the wtGFP are not ideally suited to use as a cellular 
reporter, requiring excitation with electromagnetic radiation in the near-UV 
range, which is potentially damaging to living cells. 

Consequently, significant effort has been expended to produce variant 
mutated forms of GFP with properties more suitable for use as an 
intracellular reporter. 

A number of mutated forms of GFP with altered spectral properties 
have been described. A variant-GFP (Helm et al. (1994) Proc.Natl.Acad.Sci. 
91, 12501) contains a Y66H mutation which blue-shifts the excitation and 
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emission spectrum of the protein. However, this protein is only weakly 
fluorescent and requires potentially damaging UV excitation. 

A further mutant of GFP (Heim et al. Nature, (1995), 373 , 663-664) 
5 contains a S65T mutation which red-shifts the optimum excitation and 

emission wavelengths relative to wtGFP and which is 4-6 fold brighter than 
WtGFP when expressed as a recombinant protein at 25 ^C- However, this 
variant does not yield bright fluorescence when expressed in hosts cultured 
at 37 

10 

Ehrig et al (FEBS Lett., (1995), 367 ,163-6) describe two mutations of 
GFP, T203I and E222G, which individually delete one of the excitation 
maxima of wtGFP. The E222G mutation deletes the near-UV excitation peak 
at 395 nm and produces a red-shift in the excitation peak at 475 nm to 481 
15 nm. The emission peak for this mutant protein is at 506 nm. 

W096/27675 describes two variant GFPs, obtained by random 
mutagenesis and subsequent selection for brightness, which contain the 
mutations V163A and V163A-hS175G, respectively. These variants were 
20 shown to produce more efficient expression in plant cells relative to wtGFP 
and to increase the thermotolerance of protein folding. The double mutant 
V163A-hS175G was observed to be brighter than the variant containing the 
single V163A mutant alone; however this mutant exhibits an undesirable 
_ blue-shifted excitation peak. 

25 

A further mutant, termed cycle-3, generated by molecular evolution 
through DNA shuffling (Crameri, A. et al. Nature Biotechnology, (1996), 14 , 
315-9) is available commercially from Invitrogen Inc. Cycle-3-GFP contains 
three mutations (F99S-I-M1 53T-f-V1 63A) which increase whole cell 
30 fluorescence approximately 42 fold when compared with wtGFP. However, 
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this mutant retains the near-UV excitation maximum of the wtGFP, maicing it 
less suitable as a reporter for use in living cells. 

The above mutations effectively address some of the spectral 
5 deficiencies of wtGFP as a biological reporter in providing variant forms of 
GFP which are compatible with lower energy excitation and which emit at 
wavelengths compatible with detection instrumentation commonly in use for 
measuring biological reporters. However, such mutations do not address the 
problem of inefficient folding and chromophore formation when wtGFP or 
10 spectral variants are expressed in hosts requiring growth at temperatures 
significantly greater than ambient. 

US 6172188 describes variant GFPs wherein the amino acid in 
position 1 preceding the chromophore has been mutated to provide an 

15 increase of fluorescence intensity. Such mutations include F64I, F64V, 
F64A, F64G and F64L, with F64L being the preferred mutation. These 
mutants result in a substantial increase in the intensity of fluorescence of 
GFP without shifting the excitation and emission maxima. F64L-GFP has 
been shown to yield an approximate 6-fold increase in fluorescence at 37 

20 due to shorter chromophore maturation time. 

In addition to the single mutants or randomly derived combinations of 
mutations described above, a variety of mutant-GFPs have been created 
which contain two or more mutations deliberately selected from those 
25 described above and other mutations, and which seek to combine the 

advantageous properties of the individual mutations to produce a protein with 
expression and spectral properties which are suited to use as a sensitive 
biological reporter in mammalian cells. 

30 One mutant, commonly termed EGFP, available commercially from 

Clontech Inc., contains the mutations F64L and S65T (Cormack, B.P. et al. 
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Gene, (1996), 1 73 , 33-38). These mutations when combined, confer an 
approximate 35-fold increase in brightness over wtGFP and the spectral 
characteristics permit excitation and detection of EGFP with commonly used 
fluorescein excitation (488 nm) and emission filters (505 nm-530 nm). 
5 EGFP has been optimised for expression in mammalian systems, having been 
constructed with preferred mammalian codons. 

US 6194548 discloses GFPs with improved fluorescence and folding 
characteristics at 37 °C that contain, at least, the changes F64L and VI 63A 
10 and SI 75G. A further mutant GFP containing the F64L, S65T and VI 63A 
mutations has been described (Cubitt, A.B. et al. Methods in Cell Biology, 
(1999), 58, 19-29). 



US 6077707 describes a blue fluorescent protein (BFP) containing the 
15 F64L mutation in combination with Y66H and US 6194548 describes a 
further BFP containing the F64L, Y66H, Y145F and L236R substitutions. 

The present invention provides novel engineered derivatives of green 
fluorescent protein (GFP) which exhibit enhanced fluorescence relative to 

20 WtGFP when expressed in non-homologous cells at temperatures above 

30^C, and when excited at about 490 nm compared to the parent proteins, 
i.e. WtGFP. Mutant GFPs according to the invention provide a means for 
detecting GFP reporters In mammalian cells at lower levels of expression 
and/or increased sensitivity relative to wtGFP. This greatly improves the 

25 usefulness of fluorescent proteins in studying cellular functions in living cells. 
The multiply-mutated GFPs of this invention have fluorescence properties 
which are not predictable from the properties of the individual mutations 
when studied in isolation. Furthermore, it has surprisingly been found that 
certain GFPs according to the present invention, which do not contain any 

30 mutations in the chromophore region relative to wtGFP, exhibit enhanced 
fluorescence compared with mutant GFPs described previously. 
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In a first aspect of the invention, there is provided a fluorescent 
protein which is derived from Green Fluorescent Protein (GFP) or any 
functional GFP analogue and has an amino acid sequence which is modified 
5 by amino acid substitution compared with the amino acid sequence of wild 
type Green Fluorescent Protein said modified fluorescent protein comprising: 

i) an amino acid substitution at position F64; 

ii) a single amino acid substitution at a position selected from the group 
consisting of positions S65 and E222; and 

10 ill) an amino acid substitution at position SI 75; 

wherein said modified GFP has a different excitation spectrum and/or 
emission spectrum compared with wild type GFP. 

Suitably, the amino acid F at position 64 may be substituted by an 
15 amino acid selected from the group consisting of L, I, V, A and G, thereby 
providing F64U F64I, F64V, F64A, or F64G substitutions. In a preferred 
embodiment of the first aspect, the amino acid F is substituted by L at 
position 64. 

20 Suitably, the amino acid S at position 1 75 may be substituted by an 

amino acid selected from the group consisting of G, A, L, I and T, thereby 
providing S175G, S175A, S175L, SI 751 and S175T substitutions. In a 
preferred embodiment of the first aspect, the amino acid S is substituted by 
G at position 175. 

25 

In embodiments where the amino acid S at position 65 is substituted, 
it is suitably substituted by an amino acid selected from the group consisting 
of G, A, U C, V, I and T, thereby providing S65G, S65A, S65L, S65C, 
S65V, S65I or S65T substitutions. Preferably, the amino acid substitution at 
30 position 65 is the S65T substitution. 
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In embodiments where the amino acid E at position 222 is substituted, 
it is suitably substituted by an amino acid selected from the group consisting 
of G, A, V, L, I, F, S, T, N and Q, thereby providing E222G, E222A, E222V, 
E222U E222I, E222F, E222S, E222T, E222N or E222Q substitutions. 
5 Preferably, the amino acid substitution at position 222 is the E222G 
substitution. 

Suitably, the novel fluorescent proteins exhibit high fluorescence in 
cells expressing them when said cells are incubated at a temperature of 30 
10 or above, preferably at a temperature of from 32 to 39 °C, more 

preferably from 35 °C to 38 °C and most preferably at a temperature of 
about 37 ^C. 

Preferably, the fluorescent protein according to the first aspect has an 
15 amino acid sequence which is modified by amino acid substitution compared 
with the amino acid sequence of wild type Green Fluorescent Protein having 
the sequence: SEQ ID No.2- 

A preferred protein according to the present invention Is a protein in 
20 which, in relation to SEQ ID No. 2 of GFP, the amino acid F at position 64 has 
been substituted by L, the amino acid S at position 1 75 has been substituted 
by G and the amino acid E at position 222 has been substituted by G, and is 
shown herein as having the amino acid sequence as set forth in SEQ ID 
No.3- 

25 

An alternative preferred protein according to the present invention is a 
protein in which, in relation to SEQ ID No, 2 of GFP, the amino acid F at 
position 64 has been substituted by L, the amino acid S at position 65 has 
been substituted by T and the amino acid S at position 1 75 has been 
30 substituted by G, and is shown herein as having the amino acid sequence as 
set forth in SEQ ID No.4. 
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Suitably, the GFP or functional GFP-analogue used to derive the 
fluorescent protein may be obtained from any convenient source. For 
example, native GFP derived from species of the genus Aequorea, suitably 
5 Aequorea victoria. The chromophore in wtGFP from Aequorea victoria is at 
positions 65-67 of the predicted primary amino acid sequence (SEQ ID No.2)- 
In a preferred embodiment, the GFP is derived from Aequorea victoria. 

The modified proteins of the present invention may be produced by 

10 introducing mutations in a sequence of the nucleic acid that encodes the 

protein. As used herein, a preferred sequence of the gene encoding wtGFP is 
derived from Aequorea victoria, published by Chalfie et al, (Science, (1994), 
263 , 802-5) disclosed as SEQ ID No.1 {Figure 1). The corresponding amino 
acid sequence is shown in SEQ ID No.2 (Figure 2). Alternative sequences of 

15 the GFP gene may be used, for example, the nucleotide (and predicted amino 
acid) sequences of the GFP gene described by Prasher et al, (Gene (1992), 
111 , 229) and the sequences as disclosed in WO 97/1 1094. In addition, 
alternative gene sequences that encode the fluorescent protein may 
incorporate a consensus Kozak nucleotide sequence (Kozak, M., Cell (1986), 

20 44 , 283), or preferred mammalian codons, to provide improved translation in 
mammalian systems. The nucleotide sequence corresponding to the 
fluorescent protein may also encode useful restriction enzyme sites and 
additional elements such as target sites for enzymes and purification tags. 
Methods for incorporation of a Kozak region, preferred mammalian codons, 

25 restriction enzyme sites, enzyme sites and purification tags are well known in 
the art and may result in the incorporation of amino acid residues and a 
change in numbering of amino acid residues in the fluorescent protein relative 
to the WtGFP numbering in the sequence provided. 

30 Herein, the abbreviations used for the amino acids are those stated in 

J.BioLChem., (1968), 243, 3558. 
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In a second aspect of the invention, thisre is provided a fusion 
compound comprising a protein of interest fused to a fluorescent protein 
which is derived from Green Fluorescent Protein (GFP) or any functional GFP 
5 analogue and has an amino acid sequence which is modified by amino acid 
substitution compared with the amino acid sequence of wild type Green 
Fluorescent Protein said modified fluorescent protein comprising: 

i) an amino acid substitution at position F64; 

ii) a single amino acid substitution at a position selected from the group 
10 consisting of positions S65 and E222; and 

iii) an amino acid substitution at position S175; 

wherein said modified GFP has a different excitation spectrum and/or 
emission spectrum compared with wild type GFP. 

15 In the context of the present invention, the term "protein of interest" 

is intended also to encompass polypeptides and peptide fragments. 
Examples of such proteins of interest include: NFkB and subunits thereof, 
RAC1, PLC domains, MAPKAP2, PKC, Cytochrome C, RHO, (3-actin, STAT6, 
protein kinase C isotypes, LAMP1/2 TGN, ATP7A TGN and GLUT4. 

20 

In a third aspect of the present invention there is provided a nucleic 
acid molecule comprising a nucleotide sequence encoding a fluorescent 
protein which is derived from Green Fluorescent Protein (GFP) or any 
functional GFP analogue and has an amino acid sequence which is modified 
25 by amino acid substitution compared with the amino acid sequence of wild 
type Green Fluorescent Protein said modified fluorescent protein comprising: 

i) an amino acid substitution at position F64; 

ii) a single amino acid substitution at a position selected from the group 
consisting of positions S65 and E222; and 

30 iii) an amino acid substitution at position SI 75; 
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wherein said modified GFP has a different excitation spectrum and/or 
emission spectrum compared with wild type GFP. 

Preferably, the nucleic acid molecule according to the third aspect 
5 encodes a fluorescent protein having an amino acid sequence which is 

modified by amino acid substitution compared with the amino acid sequence 
of wild type Green Fluorescent Protein having the sequence: SEQ ID No. 2. 

In a particular embodiment of the third aspect, the nucleic acid 
10 molecule comprises a nucleotide sequence encoding a fluorescent protein 

derived from Green Fluorescent Protein (GFP) or any functional GFP analogue 
according to the invention fused to a nucleotide sequence encoding a protein 
of interest. 

15 Preferably, the nucleic acid molecule is a construct comprising a DNA 

sequence. 

Preferably, the nucleic acid molecule encodes a fluorescent protein 
having an amino acid sequence selected from the group consisting of SEQ ID 
20 No. 3 and SEQ ID No.4. 

As is well known, a single amino acid may be encoded by more than 
one nucleotide codon and thus each of the above nucleotide sequences may 
be modified to produce an alternative nucleotide sequence that encodes the 

25 same peptide. Thus, the preferred embodiments of the invention include 
alternative DNA sequences that encode the preferred proteins as previously 
described. It is to be understood that the preferred proteins (and the nucleic 
acid sequences from which they are derived), may include additional 
residues, particularly N- and C-terminal amino acids, or 5'- or S'-nucleotide 

30 sequences, and still be essentially as described herein. 
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Suitably, the DNA construct encoding the novel fluorescent proteins 
may be prepared synthetically by established methods, e.g. the 
phosphoramidite method described by Beaucage and Caruthers, (Tetrahedron 
Letters (1981), 22, 1859-1869), or the method described by Matthes et al., 
5 (EMBO Journal (1984), 3, 801-805). According to the phosphoramidite 
method, oiigonucieotldes are synthesized, e.g. in an automatic DNA 
synthesizer, purified, annealed, ligated and cloned into suitable vectors. 



The DNA construct encoding the fluorescent protein may also be 
10 prepared by recombinant DNA methodology, for example cDNA cloning. See 
for example, Sambrook, J. et al (1989) Molecular Cloning - A Laboratory 
Manual, Cold Spring Harbor Laboratory Press. 

The DNA construct may also be prepared by polymerase chain reaction 
15 (PCR) using specific primers, for instance as described in US 4683202 or by 
Saiki et al (Science (1988), 239 , 487-491). A recent review of PCR 
methods may be found in PCR Protocols, (1990), Academic Press, San 
Diego, California, USA. 



20 The gene sequence encoding the fluorescent protein may be joined in- 

frame with a gene encoding the protein of interest and the desired fusion 
protein produced when inserted into an appropriate expression vector. For 
example, polymerase chain reaction or complementary oligonucleotides may 
be employed to engineer a polynucleotide sequence corresponding to the 

25 fluorescent protein, 5' or 3' to the gene sequence corresponding to the 
protein of interest. Alternatively, the same techniques may be used to 
engineer a polynucleotide sequence corresponding to the fluorescent protein 
sequence 5' or 3' to the multiple cloning site of an expression vector prior to 
insertion of a gene sequence encoding the protein of interest. The 

30 polynucleotide sequence corresponding to the fluorescent protein sequence 
may comprise additional nucleotide sequences to include cloning sites. 
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linkers, transcription and translation initiation and/or termination signals, 
labelling and purification tags. 

In a fourth aspect, there is provided an expression vector comprising 
5 suitable expression control sequences operably linked to a nucleic acid 
molecule according to the present invention. The DNA construct of the 
invention may be inserted into a recombinant vector, which may be any 
vector that may conveniently be subjected to recombinant DNA procedures. 
The choice of vector will often depend on the host cell into which it is to be 
10 introduced. Thus, the vector may be an autonomously replicating vector, ie. 
a vector which exists as an extrachromosomal entity, the replication of 
which is independent of chromosomal replication, eg. a plasmid. 
Alternatively, the vector may be one which, when introduced into a host cell, 
is integrated into the host cell genome and replicated together with the 
15 chromosome{s) into which it has been integrated. 

The vector is preferably an expression vector in which the DNA 
sequence encoding a fluorescent protein of the invention is operably linked to 
additional segments required for transcription of the DNA. In general, the 
20 expression vector is derived from plasmid or viral DNA, or may contain 

elements of both. The term, "operably linked" indicates that the segments 
are arranged so that they function in concert for their intended purposes, e.g. 
transcription initiates in a promoter and proceeds through the DNA sequence 
coding for the fluorescent protein of the invention. 

25 

The promoter may be any DNA sequence which shows transcriptional 
activity in a suitable host cell of choice, (eg. a bacterial cell, a mammalian 
cell, a yeast cell, or an insect cell) for expressing a fluorescent protein. The 
promoter may be derived from genes encoding proteins either homologous or 
30 heterologous to the host cell. 
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Examples of suitable promoters for directing the transcription of the 
DNA sequence encoding the fluorescent protein of the invention in 
mammalian cells are the CMV promoter (US 5168062, US5385839), 
Ubiquitin C promoter (Wulff, M. et al., FEBS Lett. (1990), 261 , 101-105), 

5 SV40 promoter (Subramani et ah, Mol. Cell Biol. (1981), 1, 854-864) and 
MT-1 (metallothionein gene) promoter (Palmiter et aL, Science (1983), 222 , 
809-814). An example of a suitable promoter for use in insect cells is the 
polyhedrin promoter (US 4745051; Vasuvedan et aL, FEBS Lett., (1992) 
31 1 , 7-1 1). Examples of suitable promoters for use in yeast host cells 

10 include promoters from yeast glycolytic gienes (Hitzeman et aL, J. Biol. 

Chem., (1980), 255 , 12073-12080; Alber and Kawasaki, J. Mol. Appl. Gen., 
(1982), 419-434) or alcohol dehydrogenase genes (Young et aL, in 
Genetic Engineering of Microorganisms for Chemicals (Hollaender et ai, eds.). 
Plenum Press, New York, 1982), or the TPI1 (US 459931 1) or ADH2-4c 

15 (Russell et aL, Nature, (1983), 304, 652-654) promoters. 

Examples of suitable promoters for use in bacterial host cells include 
the promoter of the Bacillus stearothermophilus maltogenic amylase gene, 
the Bacillus licheniformis alpha-amylase gene, the Bacillus amyloliquefaciens 
20 BAN amylase gene, the Bacillus subtilis alkaline protease gene, or the Bacillus 
pumilus xylosidase gene, or the phage Lambda PR or PL promoters or the 
Escherichia coll lac, trp or tac promoters. 



The DNA sequence encoding the novel fluorescent proteins-of the 
25 invention may also, if necessary, be operably connected to a suitable 

terminator, such as the human growth hormone terminator (Palmiter et aL, 
op. cit.) or (for fungal hosts) the TPI1 (Alber and Kawasaki, op. oit.) or ADH3 
(McKnight et aL, op. cit.) terminators. The vector may further comprise 
elements such as polyadenylation signals (e-g- from SV40 or the adenovirus 
30 5 Elb region), transcriptional enhancer sequences (e,g. the SV40 enhancer) 
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and translational enhancer sequences (e.g. the ones encoding adenovirus VA 
RNAs). 

The vector may further comprise a DNA sequence enabling internal 
ribosomal entry and expression of two proteins from one bicistronic 
transcript mRNA molecule. For example, the internal ribosomal entry 
sequence from the encephalomyocarditis virus {Rees S, et al, BioTechniques 
(1 996), 20, 1 02-1 1 0 and US 49371 90). 



The recombinant vector may further comprise a DNA sequence 
enabling the vector to replicate in the host cell in question. An example of 
such a sequence (when the host cell is a mammalian cell) is the SV40 origin 
of replication. 

When the host cell is a yeast cell, examples of suitable sequences 
enabling the vector to replicate are the yeast plasmid 2|li replication genes 
REP 1-3 and origin of replication. 

The vector may also comprise selectable markers, such as a gene that 
confers resistance to a drug, e.g. ampicillin, kanamycin, tetracycline 
chloramphenicol, puromycin, neomycin or hygromycin. 

The procedures used to ligate the DNA sequences coding for the 
fluorescent protein of the invention, the promoter and optionally the 
terminator and/ or targeting sequence, respectively, and to insert them into 
suitable vectors containing the information necessary for replication, are well 
known to persons skilled in the art (eg. Sambrook et al., op.cit.). 

In a fifth aspect of the invention, there is provided a host cell 
transformed or transfected with a DNA construct comprising an expression 
vector according to the present invention. 
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The DNA construct or the recombinant vector of the invention is 
suitably introduced into a host cell which may be any cell which is capable of 
expressing the present DNA construct and includes bacteria, yeast and 
5 higher eukaryotic cells (Unger, T.F., The Scientist (1997), H{17), 20-23; 
Smith, C-, The Scientist (1998), 12(22): 20; Smith, C, The Scientist (1998), 
_12^(3), 18; Fernandez, J.M. & Hoeffler, J. P., Gene Expression Systems- using 
nature for the art of expression. Academic Press 1999). 

Examples of bacterial host cells which, on cultivation, are capable of 
expressing the DNA construct of the invention are Gram-positive bacteria, 
eg. species of Bacillus or Gram-negative bacteria such as E. colL The 
transformation of the bacteria may be effected by using competent cells in a 
manner known per se (cf. Sambrook et al., supra). 

Examples of suitable mammalian cell lines are the HEK293 and the 
HeLa cell lines, primary cells, and the COS (e.g. ATCC CRL 1650), BHK (eg. 
ATCC CRL 1632, ATCC CCL 10), CHL (e.g. ATCC CCL39) or CHO (eg. 
ATCC CCL 61) cell lines. Methods of transfecting mammalian cells and 
expressing DNA sequences introduced in the cells are described in eg. 
Kaufman and Sharp, J. MoL Biol., (1982), 159 , 601-621; Southern and 
Berg, J. MoL Appl. Genet., (1982), 1, 327-341; Loyter et al., Proc. Natl. 
Acad. Sci. USA, (1982), 79, 422-426; Wigler et al.. Cell, (1978), 14, 725; 
Corsaro and Pearson, Somatic Cell Genetics, (1981), 7, 603, Graham and 
van der Eb, Virology (1973), 52, 456; and Neumann et aL, EMBO J., (1982), 
1, 841-845. 

Examples of suitable yeast cells include cells of Saccharomyces spp. or 
Schizosaccharomyces spp., in particular strains of Saccharomyces cereyisiae 
30 or Saccharomyces kluyverL Methods for transforming yeast cells with 
heterologous DNA and producing heterologous polypeptides therefrom are 



10 



15 



20 



25 
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described, e.g. in US 4599311, US 4931373, US 4870008, US 5037743, 
and US 4845075, all of which are hereby incorporated by reference. 
Transformed cells are selected by a phenotype determined by a selectable 
marker, commonly drug resistance or the ability to grow in the absence of a 
5 particular nutrient, e.g. leucine. A preferred vector for use in yeast is the 
POT1 vector disclosed in US 4931373. The DNA sequence encoding the 
fluorescent protein of the invention may be preceded by a signal sequence 
and optionally a leader sequence , e.g. as described above. Further examples 
of suitable yeast cells are strains of Kluyveromyces , such as K. lactis, 
10 Hansenula, e.g. H. polymorpha, or Pichia, e.g. P. pastoris (cf- Gleeson et al., 
J. Gen. Microbiol., (1986), 132, 3459-3465; US 4882279). 

Transformation of insect cells and production of heterologous 
polypeptides therein may be performed as described in US 4745051; US 

15 4879236; US 5155037; US 5162222; EP 397485, all of which are 

incorporated herein by reference. The insect cell line used as the host may 
suitably be a Lepidoptera cell line, such as Spodoptera frugiperda cells or 
Trichoplusia ni cells (cf. US 5077214). Culture conditions may suitably be as 
described in, for instance, WO 89/01029 or WO 89/01028, or any of the 

20 aforementioned references. 

In a sixth aspect, the invention provides a method for preparing a 
Green Fluorescent Protein (QFP) or a functional GPP analogue according to 
the present invention, the method comprising cultivating a host cell 
25 transformed or transfected with a nucleotide sequence according to the 
invention and obtaining therefrom the polypeptide expressed by said 
nucleotide sequence. 



30 



Suitably, the transformed or transfected host cells as described above 
are cultured in a suitable nutrient medium under conditions permitting the 
expression of a DNA construct according to the invention, after which the 
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cells may be used in the screening method of the invention. Alternatively, 
the cells may be disrupted after which cell extracts and/or supernatants may 
be analysed for fluorescence and/ or used to purify the GFP or functional GFP 
analogue of the invention. 

5 

The medium used to culture the cells may be any conventional medium 
suitable for growing the host cells, such as minimal or complex media 
containing appropriate supplements. Suitable media are available from 
commercial suppliers or may be prepared according to published protocols 
10 (eg. in catalogues of the American Type Culture Collection; Sambrook et aL, 
supra). 



For example, a fusion protein comprising glutathione S-transferase 
(GST) and GFP can be constructed and expressed in £. coli\ The GFP may be 
15 joined in-frame to the C-terminus of GST in a pGEX plasmid vector 

(Amersham Pharmacia Biotech). Recombinant production of the fusion 
protein is carried out utilising a standard E. coli expression host, followed by 
purification employing glutathione affinity chromatography and removal of 
the GST tag by proteolytic cleavage. 

20 

In a seventh aspect of the present invention, there is provided a 
method of measuring the expression of a protein of interest in a cell. The 
method comprises: i) introducing into a cell a nucleic acid molecule 
comprising a nucleotide sequence encoding a fluorescent protein which is 

25 derived from Green Fluorescent Protein (GFP) or any functional GFP analogue 
according to the present invention said nucleic acid molecule being operably 
linked to and under the control of an expression control sequence which 
moderates expression of said protein of interest; ii) culturing the cell under 
conditions suitable for the expression of the protein of interest; and iii) 

30 detecting the fluorescence emission of the Green Fluorescent Protein (GFP) or 
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a functional GFP analogue as a means of measuring the expression of the 
protein of interest. 

In an eighth aspect of the present invention, there is provided a 
5 method of determining the cellular and/or extracellular localisation of a 
protein of interest which method comprises: 

i) introducing into a cell a nucleic acid molecule comprising a nucleotide 
sequence encoding a Green Fluorescent Protein (GFP) or a functional GFP 
analogue according to the invention fused to a nucleotide sequence encoding 

10 a protein of interest, said nucleic acid molecule being operably linked to and 
under the control of a suitable expression control sequence; 

ii) culturing said cell under conditions suitable for the expression of said 
protein of interest; and 

iii) determining the cellular and/or extracellular localisation of said protein 
15 of interest by detecting the fluorescence emission by optical means. 

The fluorescent proteins of the present invention may also be used in a 
method to detect and compare the effect of a test substance on the 
regulation of expression and/or translocation of two or more different 

20 proteins of interest in a cell. Alternatively, they may be used in a method to 
compare the expression of a protein of interest and the simultaneous activity 
of an expression control sequence in response to a test substance. The 
fluorescent proteins may also be used in a method to compare the activity of 
two or more expression control sequences in a cell in response to a test 

25 substance. Such methods may be performed in the presence and in the 

absence of a test substance whose effect on the process is to be measured. 
For example, one detectable reporter molecule may be used as an internal 
reference and another as a variable marker, since regulated expression of a 
gene can be monitored quantitatively by fusion of an expression control 

30 sequence to a DNA construct encoding, eg. F64L-S1 75G-E222G-GFP, 
measuring the fluorescence, and normalising it to the fluorescence of a 
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constitutively expressed spectrally distinct fluorescent molecule. The 
constitutively expressed spectrally distinct fluorescent molecule, for example 
BFP, acts as an internal reference. 

5 Thus, In a ninth aspect of the present invention, there is provided a 

method of comparing the effect of one or more test substance(s) on the 
expression and/or localisation of one or more different protein(s) of interest in 
a cell which method comprises: 

i) introducing into a ceil: 

10 a) a nucleic acid molecule comprising a nucleotide sequence encoding a 
Green Fluorescent Protein (GPP) or a functional GPP analogue according to 
the invention optionally fused to a nucleotide sequence encoding a first 
protein of interest, said nucleic acid molecule being operably linked to and 
under the control of a first expression control sequence; and optionally, 

15 b) at least one different nucleic acid molecule encoding a protein reporter 
molecule optionally fused to a different protein of interest, each said nucleic 
acid molecule being operably linked to and under the control of a second 
expression control sequence wherein said protein reporter molecule has or is 
capable of generating an emission signal which is spectrally distinct from that 

20 of said Green Fluorescent Protein (GFP) or a functional GFP analogue; 

ii) culturing said cells under conditions suitable for the expression of said 
protein(s) of interest in the presence and absence of said test substance(s); 

iii) determining the expression and/or localisation of said protein{s) of 

- interest in said cells by detecting the fluorescence emission by optical means; 
25 and 

iv) comparing the fluorescence emission obtained in the presence and 
absence of said test substance(s) to determine the effect of said test 
substance(s) on the expression and/or localisation of said protein{s) of 
interest. 



30 
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In a preferred embodiment of the ninth aspect, samples of said cells in 
a fluid medium are introduced into separate vessels for each of said test 
substances to be studied. 

5 Preferably, the first and second expression control sequences are 

different. 

Suitably, the protein reporter molecule may be selected from the group 
consisting of fluorescent proteins and enzymes- Preferred fluorescent 

10 proteins are those which have a spectrally distinguishable emission 

wavelength compared with the emission wavelength of the fluorescent 
proteins according to the present invention, for example, BFP. Suitable 
enzyme reporters are those which are suitable for generating a detectable 
{eg. a luminescent or fluorescent) signal in a substrate. Suitable 

15 enzyme/substrates include: luciferase/luciferin; p-galactosidase/DDAO 

galactoside; p-galactosidase/fluorescein di-p-D-galactopyranoside; alkaline 
phosphatase/Attophos. 

In the methods of the invention, the fluorescence of cells transformed 
20 or transfected with the DNA construct according to the invention may 

suitably be measured by optical means in for example; a spectrophotometer, 
a fluorimeter, a fluorescence microscope, a cooled charge-coupled device 
(CCD) imager (such as a scanning imager or an area imager), a fluorescence 
activated cell sorter, a confocal microscope or a scanning confocal device, 
25 where the spectral properties of the cells in culture may be determined as 
scans of light excitation and emission. 



30 



The fluorescent proteins of the present invention have many additional 
applications, for example: 
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i) Use as a non-toxic marker for selection of transfected cells containing 
an expression vector encoding at least the fluorescent protein of the 
invention. The fluorescent emission may be used to isolate transfected cells 
thereby overcoming the need for selection with toxic molecules such as 

5 antibiotics. 

ii) Use as a protein label in living and fixed cells. The novel proteins 
exhibit strong fluorescence and are a suitable label for proteins present at 
low concentrations. Since no substrate is needed and visualization of the 

10 fluorescent protein does not damage the cells, dynamic analysis can be 
performed. 

iii) Use as a marker in cell or organelle fusion. By labelling one or more 
cells or organelles with the novel proteins, for example, F64L-S1 75G-E222G- 

15 GFP, and other cells or organelles with same or another fluor, fusions such as 
heterokaryon formation can be monitored. 

iv) Translocation of proteins fused to the novel proteins of the invention 
can be visualised. The translocation of intracellular proteins to a specific 

20 organelle can be visualised by fusing the protein of interest to a fluorescent 
protein, for example, F64L-S1 75G-E222G-GFP and labelling the organelle 
with another fluorescent molecule, eg. fluorescent protein. Translocation can 
then be detected as a spectral shift in the fluorescent proteins in the specific 
organelle. 

25 

v) Use as a secretion marker. By fusion of a fluorescent protein of the 
invention to a signal peptide or a peptide to be secreted, secretion may be 
followed in living cells. 

30 vi) Use as genetic reporter or protein tag in transgenic animals. Due to 
the strong fluorescence of the novel proteins, they are suitable as tags for 
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proteins and gene expression, since the signal to noise ratio is significantly 
improved over the prior art proteins, such as wild-type GFP. 

vii) Use as a cell or organelle integrity marker. By expressing the novel 
proteins targeted to an organelle, it is possible to calculate the leakage of the 
protein and use that as a measure of cell integrity. 

viii) Use as a transfection marker, and as a marker to be used in 
combination with FACS sorting (eg. as described in Example 3). Due to the 
increased brightness of the novel proteins the quality of cell detection and 
sorting can be significantly improved. 

ix) Use as real-time probe working at near physiological concentrations. 
Since the novel proteins of the present invention are significantly brighter 
than wtQFP when expressed in cells at about 37 °C and excited with light at 
about 490 nm, the concentration needed for visualization can be lowered. 
Target sites for enzymes engineered into the novel proteins, for example 
F64L-S175G-E222G-GFP, can therefore be present in the cell at low 
concentrations in living cells. This is important for two reasons: i) the probe 
must interfere as little as possible with the intracellular process being 
studied; and ii) the translational and transcriptional apparatus should be 
stressed minimally. 

x) Transposon vector mutagenesis can be performed using the novel 
proteins as markers in transcriptional and translational fusions. Transposons 
may be used in microorganisms encoding the novel proteins. The transposons 
may be constructed for translational and transcriptional fusion to be used for 
screening for promoters. Transposon vectors encoding the novel proteins, 
for example F64L-S1 75G-E222G-GFP, can be used for tagging plasmids and 
chromosomes- 
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xi) Use as a reporter for bacterial detection by introducing the novel 
proteins into the genome of bacteriophages. By engineering the novel 
proteins, for example F64L-S1 75G-E222G-GFP, into the genome of a phage 
a diagnostic tool can be designed. F64L-S1 75G-E222G-GFP will be 
expressed only upon transfection of the genome into a living host. The host 
specificity is defined by the bacteriophage. 

The invention is further illustrated by reference to the following 
examples and figures in which: 

Figure 1 is the nucleotide Sequence of wtGFP (Chalfie et al. Science, (1994), 
263, 802-5) and referred to herein as SEQ ID No.1 . 

Figure 2 is the corresponding amino acid sequence of wtGFP (Chalfie et al. 
Science, (1994), 263, 802-5) and referred to herein as SEQ ID No. 2. 
Figure 3 is the predicted amino acid sequence of F64L-S1 75G-E222G-GFP 
and referred to herein as SEQ ID No. 3. 

Figure 4 is the predicted amino acid sequence of F64L-S65T-S1 75G-GFP and 
referred to herein as SEQ ID No. 4. 

Figure 5 is a plot showing average fluorescence intensities of mutant GFPs 
according to the invention. 

Figure 6 is a plot showing relative photodegradation of mutant GFPs 
according to the invention. 

Figure 7 is a plot demonstrating the increase in the ratio of nuclear to 
cytoplasmic fluorescence intensity on translocation of P65-GFP from the 
cytoplasm to the nucleus of CHO-hir cells following agonist addition. 
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EXAMPLES 



1 . Cloning of GFP gene and template vector construction 



The GFP gene used in the present study was contained within the 
plasmid pGFP (Chalfie et aL, Science, (1994), 263 , 802-805; GenBank 
accession number U17997) obtained from Clontech Laboratories Inc. (Palo 
Alto, Ca, USA). The gene was amplified by PGR using Pfu polymerase 
10 (Promega, Madison, Wl, USA) according to recognised protocols (Saiki et aL, 
Science, (1988), 239, 487-491). The sequences of primers used were: 



GFP-1 5'-ggtacgggccgccaccatgagtaaaggagaagaacttttcac SEQ ID No. 5 

GFP-2 5'-ggtacgggttaaccggttttgtatagttcatccatg SEQ ID No.6 

GFP-3 5'-ggtacgggccgccaccatgggatGcaaaggagaagaacttttcac SEQ ID No. 7 

Primer GFP-1 exhibits homology to the 5' region of the GFP gene and 
15 contains a partial Kozak site (Kozak, M, Cell, (1986), 44, 283) prior to the 
start codon for efficient initiation of translation in mammalian systems. 
Primer GFP-2 exhibits homology to the 3' region of the GFP gene and 
contains an additional Age\ restriction enzyme site immediately prior to the 
stop codon to facilitate cloning of proteins by fusion to the C-terminus of 
20 GFP. Primer GFP-3 is similar to primer GFP-1 exhibiting homology to the 5' 
region of the GFP gene, but contains an additional restriction site {BamH\) 
immediately after the initiation codon to facilitate cloning of proteins by 
fusion to the N-terminus of GFP. Amplified products resulting from PGR 
reactions containing primers GFP-1 and GFP-2, and GFP-3 and GFP-2 were 
25 tailed with a single 3'-deoxyadenosine using Taq polymerase (Amersham 
Pharmacia Biotech, Amersham, UK) and ligated into the TA cloning vector 
pTARGET (Promega) according to manufacturer's instructions. The correct 
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orientation relative to the CMV promoter and sequence of the insert was 
determined by automated DNA sequencing. 

2. Generation of mutants of GFP 

5 

The following mutants of GFP were generated in the present study: 
F64L-GFP, V163A-GFP, S175G-GFP, E222G-GFP, F64L-E222G-GFP, F64L- 
V163A-GFP, F64L-S175G-GFP, VI 63A-S1 75G-GFP, VI 63A-E222G-GFP, 
S175G-E222G-GFP, F64L-S1 75G-E222G-GFP, VI 63A-S1 75G-E222G-GFP, 

10 F64L-V1 63A-E222G-GFP, F64L-S65T-S1 75G-GFP, F64L-S65T-V1 63A-GFP, 
Mutants of the GFP gene (SEQ ID 3) construct within pTARGET (See 
Example 1) were generated using the QuikChange™ site-directed mutagenesis 
kit (Stratagene, La Jolla, Ca, USA) according to manufacturer's instructions. 
The sequences of primers used to generate F64U S65T, V163A, S175G and 

15 E222G single mutants have been documented in Table 1 . Multiply-mutated 
GFP molecules were generated through successive mutagenesis reactions. 
All GFP mutant sequences were verified by automated sequencing. 



Table 1 



Primer 


Mutation 


Sequence (5' - 3') 


SEQ ID 

No. 


GFP-64f 


F64L 


ccaacacttgtcactactctctcttatggtgttcaat 


8 


GFP-64r 


F64L 


attgaacaccataagagagagtagtgacaagtgttgg 


9 


GFP-65f 


S65T 


ccaacacttgtcactactctcacctatggtgttcaatgcttttca 


10 


GFP-65r 


S65T 


tgaaaagcattgaacaccataggtgagagtagtgacaagtgttgg 


1 1 


GFP-1 63f 


VI 63 A 


gacaaacaaaagaatggaatcaaagccaacttcaaaattagacac 


12 


GFP-1 63r 


VI 63 A 


gtgtctaattttgaagttggctttgattccattcttttgtttgtc 


13 


GFP-1 75f 


S175Q 


caacattgaagatggaggcgttcaactagcagacc 


14 


GFP-1 75r 


S175G 


ggtctgctagttgaacgcctccatcttcaatgttg 


15 


GFP-222f 


E222G 


ccacatggtccttcttggctttgtaacagctgctgg 


16 


GFP-222r 


E222G 


ccagcagctgttacaaagccaagaaggaccatgtgg 


17 



wo 02/085936 



-25- 



PCT/GBOl/04363 



3. Influence of individual mutations and combinations of F64L S65T, 
V163A, S175G and E222G mutations upon GFP vyhen expressed in 
mammalian cells 

Plasmid DNA to be used for transfection was prepared for all GFP and 
EGFP constructs using the HiSpeed plasmid purification kit (Qiagen, 
Westberg, NL). DNA was diluted to 100 ng. |liI-1 in 18-Megohm water 
(Sigma) and 1 ^ig used for transfections. For 50-80% confluency on the day 
of transfection, HeLa cells were plated at a density of 5x10Vwell in 6-well 
plates and incubated overnight. A 1:3 (1 p-g : 3 ratio of DNA to FuGene6 
reagent (Roche) was used for each transient transfection reaction; 3 |iil 
FuGene6 was added to 87 jxl serum-free DMEM medium (Sigma) (containing 
penicillin/streptomycin, L-glutamine (GibcoBRL) and gently tapped to mix, 
then 10 |Lil (1 iig) construct DNA was added and again gently mixed. The 
FuGene6:DNA complex was incubated at room temperature for 40 minutes 
then added dropwise directly to the cells without changing the medium, and 
the plates swirled for even distribution. 

Fluorescence measurements were made 24 or 48 hours after 
transfection. Briefly, the cells were washed in phosphate-buffered saline, 
released with the addition of 2 drops of Trypsin (GibcoBRL) and resuspended 
in 1 ml of complete DMEM medium (containing penicillin/streptomycin, L- 
glutamine and foetal bovine serum (Sigma). The cells were vortexed and 
analysed on a FAGS Calibur flow cytometer (Becton Dickinson & Co., NJ, 
USA) for characterisation of whole cell fluorescence, with excitation at 488 
nm and emission viewed with fluorescence filter set 530/30nm (range 515- 
545 nm). 10,000 events were collected for each transfection and 6-10 
replicates carried out for each construct. Average fluorescent intensities 
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from the FACS analysis were obtained as geometric means (mean 
fluorescence on log scale) and are shown In Figure 5. 

4. Purification of fluorescent proteins from £. coli 

5 

The gene for the mutant F64L-S175G-E222G-GFP (Example 2) was 
excised from pTARGET with fiamHI and Sa/\ and sub-cloned into the IPTG- 
inducible, GST-fusion vector pGEX-6P1 (Amersham Pharmacia Biotech). £. 
co// JM109 cells (Promega) containing an expression vector with the GST- 

10 GFP gene fusion were grown at 30°C to an OD6oo = 0.6 in 2x YT broth 

containing 100 \xg/m\ ampicillin. Protein expression was induced with IPTG 
(0.1 mM) and incubation continued for 16 hours. Cells were pelleted by 
centrifugation, resuspended in PBS and lysed by sonication (four 10 second 
bursts at 20 \xm with intermittent cooling on ice). Cellular debris was 

15 removed by centrifugation and the lysate containing soluble GST-GFP fusion 
protein was purified using glutathione sepharose columns (Amersham 
Pharmacia Biotech). Protein was then exchanged and eluted in PBS using a 
PD10 column (Amersham Pharmacia Biotech). The presence of a single band 
of correct molecular weight in the protein preparation was confirmed by SDS- 

20 PAGE using 4-12% Bis-Tris NuPAGE gel electrophoresis (Invitrogen). To 
assess protein concentration and purity, the protein preparation was 
subjected, in duplicate, to acid hydrolysis and filtration before amino acid 
analysis by ion exchange chromatography using a Pharmacia alpha plus 
series II analyser. 

25 

The extinction coefficient (Table 2) was determined on a UV/vis 
spectrometer (Unicam). Quantum yield (Table 2) was determined according 
to the method documented by Patterson et al (Biophysical Journal, (1997), 
73 , 2782-2790). Samples of equal optical density at respective absorbance 
30 maxima were prepared, and diluted, in lOmM Tris.HCI pH 8 for the purified 
GFP preparation and a fluorescein reference standard (Molecular Probes), 
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Fluorescence emission was nneasured in the region 490 - 600nm using a 
LS50B luminescence spectrometer (Perkin Elmer) and results for the GFP 
preparation were compared directly to those for the fluorescein standard 
(QY-0.85). 



Table 2 



Protein 


Absorbance 
peak (nm) 


Extinction coefficient 
(M-' cm ') 


Emission 
peak (nm) 


QY 


F64L-S175G- 
E222G-GFP 


481 


46213* 


506 


0.6* 



Mean of two measurements 



10 To evaluate the degree of photodegradation of the mutants F64L- 

S175G-E222G-GFP and F64L-E222G relative to wtGFP, 50ng of DNA was 
transfected into HeLa cells according to the method outlined in Example 3. 
For 50-80% confluency on the day of transfection, HeLa cells were plated at 
a density of 5x10^/well in a ViewPlate™-96 (Packard, Meriden CT, USA). 

15 Twenty-four hours after transfection, the cells were imaged live on a 
LEADseeker™ Cell Analysis System (Amersham Pharmacia Biotech) and 
bleached at high laser power (19.94mW) with a 488nm Argon laser 
(emission filter 535-45nm)- Thirty-two individual images were taken over 
260s with non-continuous illumination and all fluorescent proteins showed 

20 marked photodegradation as shown in Figure 6. 

5. Measurement of NFkB translocation 

NFkB is an activator of transcription and a component of signalling 
25 pathways which are responsive to a variety of inducers including cytokines, 
lymphokines, and some immunosuppressive agents. 
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The human NFkB P65 subunit gene (GenBank Accession number: 
M62399) was amplified using PGR according to recognised protocols (Saiki 
et al.. Science, (1988), 239 , 487-491). The sequences of primers used 
were: 

NFkB-1 5'-ttttactcgagatggacgaactgttccocctca SEQ ID No. 18 

NFkB-2 5'-ttttgaagcttggagctgatctgactcagcagg SEQ ID No. 19 



The P65 subunit was fused to the N terminus of GFP (SEQ ID No. 3) in 
the vector pCORONIOOO (Amersham Pharmacia Biotech), under the control 
of a CMV promoter. This was transfected into CHO-hir cells using FuGene6 
10 reagent (Roche) and standard transfection procedures and a stable cell line 
was produced containing the P65-GFP construct. 

CHO-hir, P65-GFP cells were seeded into 96 well microtitre plates at a 
conf luency of 5 x 1 0^ cells/well in DMEM media (Sigma) containing 

15 penicillin/streptomycin, L-glutamine (GibcoBRL) and incubated overnight at 
37 ^C. 1 hr before the assay was run, the media was removed and replaced 
with 100 jal serum free DMEM/well. 100 |li1 of 5 [iWl DRAQ5 (Biostatus) in 
Krebs buffer was added to each well and incubated for 15 minutes at 37°C. 
The plate was then placed in the imager (LEADseeker Cell Analysis System) 

20 and wells were imaged at varying time points following addition of agonist 
(50|j,l of 40 ng/ml ILIp). Translocation of the P65-GFP was observed from 
the cytoplasm to the nucleus following agonist addition. The ratio of 
nuclear/cytoplasmic fluorescence is shown in Figure 7. 



25 
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Claims 

1 . A fluorescent protein which is derived from Green Fluorescent Protein 
5 (GFP) or any functional GPP analogue and has an amino acid sequence which 

is modified by amino acid substitution compared with the amino acid 
sequence of wild type Green Fluorescent Protein said modified fluorescent 
protein comprising: 

i) an amino acid substitution at position F64; 
10 ii) a single amino acid substitution at a position selected from the group 

consisting of positions S65 and E222; and 
iii) an amino acid substitution at position S175; 
wherein said modified GFP has a different excitation spectrum and/or 
emission spectrum compared with wild type GFP. 

15 

2. A fluorescent protein according to claim 1 wherein the amino acid F at 
position 64 has been substituted by an amino acid selected from the group 
consisting of L, U V, A and G. 

20 3. A fluorescent protein according to claim 1 wherein the amino acid S at 
position 1 75 has been substituted by an amino acid selected from the group 
consisting of G, A, U I and T. 

4. A fluorescent protein according to claims 1 to 3 wherein the amino 
25 acid S at position 65 has been substituted by an amino acid selected from 

the group consisting of G, A, U C, V, I and T. 

5. A fluorescent protein according to claims 1 to 3 wherein the amino 
acid E at position 222 has been substituted by an amino acid selected from 

30 the group consisting of G, A, V, L, I, F, S, T, N and Q. 
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6. A fluorescent protein according to any of claims 1 to 5 selected from 
F64L-S175G-E222G-GFP and F64L-S65T-S1 75G-GFP. 

7. A fluorescent protein according to any of claims 1 to 6 having an 

5 amino acid sequence which is modified by amino acid substitution compared 
with the amino acid sequence of wild type Green Fluorescent Protein having 
the sequence: SEQ ID No. 2. 

8. A fluorescent protein derived from Green Fluorescent Protein (GFP) and 
10 having the amino acid sequence as set forth In SEQ ID No. 3. 

9 A fluorescent protein derived from Green Fluorescent Protein (GFP) and 
having the amino acid sequence as set forth In SEQ ID No.4. 

15 10. A fusion compound comprising a protein of interest fused to a 
fluorescent protein said fluorescent protein being a modified protein 
according to any of claims 1 to 9. 

11. A nucleic acid molecule comprising a nucleotide sequence encoding a 
20 fluorescent protein which is derived from Green Fluorescent Protein (GFP) or 
any functional GFP analogue and has an amino acid sequence which is 
modified by amino acid substitution compared with the amino acid sequence 
of wild type Green Fluorescent Protein said modified fluorescent protein 
comprising: 

25 i) an amino acid substitution at position F64; 

ii) a single amino acid substitution at a position selected from the group 
consisting of positions 865 and E222; and 

iii) an amino acid substitution at position SI 75; 

wherein said modified GFP has a different excitation spectrum and/or 
30 emission spectrum compared with wild type GFP. 
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12. A nucleic acid according to claim 1 1 encoding a fluorescent protein 
having an amino acid sequence which is modified by amino acid substitution 
compared with the amino acid sequence of wild type Green Fluorescent 
Protein having the sequence: SEQ ID No-2. 

5 

13. A nucleic acid molecule according to claims 1 1 or 12 encoding a 
fluorescent protein having an amino acid sequence selected from the group 
consisting of SEQ ID No. 3 and SEQ ID No.4. 

10 14. A nucleic acid molecule comprising a nucleotide sequence encoding a 
fusion protein comprising a protein of interest fused to a fluorescent protein 
according to any one of claims 1 to 9. 

15. An expression vector comprising suitable expression control sequences 
15 operably linked to a nucleic acid molecule according to any of claims 1 1 to 

14. 

16. A host cell transformed or transf acted with a DNA construct 
comprising an expression vector according to claim 15, 

20 

17. The host cell according to claim 16 wherein said host cell is selected 
from the group consisting of a mammalian cell, a bacterial cell, a yeast cell 
and an insect cell. 

25 18. A method for preparing a Green Fluorescent Protein (GFP) or a 
functional GFP analogue according to the present invention said method 
comprising cultivating a host according to claim 1 6 or claim 1 7 and obtaining 
therefrom the polypeptide expressed by said nucleotide sequence. 

30 19. A method of measuring the expression of a protein of interest in a cell 
which method comprises: 
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i) introducing into a cell a nucleic acid molecule comprising a nucleotide 
sequence encoding a fluorescent protein which is derived from Green 
Fluorescent Protein (GFP) or any functional GPP analogue according to any 
one of claims 1 to 9 said nucleic acid molecule being operably linked to and 

5 under the control of an expression control sequence which moderates 
expression of said protein of interest; 

ii) culturing said cell under conditions suitable for the expression of said 
protein of interest; and 

iii) detecting the fluorescence emission of said Green Fluorescent Protein 
10 (GFP) or a functional GFP analogue as a means of measuring the expression 

of said protein of interest. 

20. A method of determining the cellular and/or extracellular localisation of 
a protein of interest which method comprises: 

15 i) introducing into a ceil a nucleic acid molecule comprising a nucleotide 
sequence encoding a fluorescent protein which is derived from Green 
Fluorescent Protein (GFP) or any functional GFP analogue according to any 
one of claims 1 to 9 fused to a nucleotide sequence encoding a protein of 
interest, said nucleic acid molecule being operably linked to and under the 

20 control of a suitable expression control sequence; 

ii) culturing said cell under conditions suitable for the expression of said 
protein of interest; and 

iii) determining the cellular and/or extracellular localisation of said protein 
of interest by detecting the fluorescence emission by optical means. 

25 

21 . A method of comparing the effect of one or more test substance{s) on 
the expression and/or localisation of one or more different protein{s) of 
interest in a cell which method comprises: 

i) introducing into a cell: 
30 a) a nucleic acid molecule comprising a nucleotide sequence encoding a 
Green Fluorescent Protein (GFP) or a functional GFP analogue according to 
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any one of claims 1 to 9 optionally fused to a nucleotide sequence encoding 
a first protein of interest, said nucleic acid molecule being operably linked to 
and under the control of a first expression control sequence; and optionally, 
b) at least one different nucleic acid molecule encoding a protein reporter 

5 molecule optionally fused to a different protein of interest, each said nucleic 
acid molecule being operably linked to and under the control of a second 
expression control sequence wherein said protein reporter molecule has or is 
capable of generating an emission signal which is spectrally distinct from that 
of said Green Fluorescent Protein (GFP) or functional GPP analogue; 

10 ii) culturing said cells under conditions suitable for the expression of said 
protein(s) of interest in the presence and absence of said test substance(s); 
iii) determining the expression and/or localisation of said protein(s) of 
interest in said cells by detecting the fluorescence emission by optical means; 
and 

15 iv) comparing the fluorescence emission obtained in the presence and 
absence of said test substance{s) to determine the effect of said test 
substance(s) on the expression and/or localisation of said protein{s) of 
interest. 

20 22. The method according to claim 21 wherein samples of said cells in a 
fluid medium are introduced into separate vessels for each of said test 
substances to be studied. 
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Nucleotide Sequence of wtGFP (Chalfie et ai. Science, (1994), 263. 802-5): 

SEQ ID No,1 

atg agt aaa gga gaa gaa ctt ttc act gga gtt gtc cca att ctt gtt 48 

gaa tta gat ggt gat gtt aat ggg cac aaa ttt tct gtc agt gga gag 96 

99t gaa ggt gat gca aca tac gga aaa ctt acc ctt aaa ttt att tgc 144 

act act gga aaa eta act gtt cca tgg cca aca ctt gtc act act ttc 192 

tct tat ggt gtt caa tgc ttt tea aga tac cca gat cat atg aaa egg 240 

cat gac ttt ttc aag agt gcc atg ccc gaa ggt tat gta cag gaa aga 2 88 

act at a ttt ttc aaa gat gac ggg aac tac aag aca cgt get gaa gtc 336 

aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt att 384 

gat ttt aaa gaa gat gga aac att ctt gga cac aaa ttg gaa tac aac 432 

tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat gga 480 

ate aaa gtt aac ttc aaa att aga cac aac att gaa gat gga age gtt 528 

caa eta gca gac eat tat caa caa aat act cca att ggc gat ggc cct 576 

gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gcc ctt teg 624 

aaa gat eee aac gaa aag aga gac cac atg gtc ctt ctt gag ttt gta 672 

aca get get ggg att aca cat ggc atg gat gaa eta tac aaa tag 717 
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Amino Acid Sequence of wtGFP (Chalfie et aL Science, (1994), 263, 802-5 

SEQ ID No.2 



Met Ser Lys Gly 
1 

Glu Leu Asp Gly 

20 

Gly Glu Gly Asp 
35 

Thr Thr Gly Lys 
50 

Ser Tyr Gly Val 
65 

His Asp Phe Pile 

Thr lie Phe Phe 

100 

Lys Plie Glu Gly 
115 

Asp Phe Lys Glu 
130 

Tyr Asn Ser His 
145 

lie Lys Val Asn 

Gin Leu Ala Asp 

180 

Val Leu Leu Pro 
195 

Lys Asp Pro Asn 
210 

Thr Ala Ala Gly 
225 



Glu Glu Leu Phe 
5 

Asp Val Asn Gly 

Ala Thr Tyr Gly 

40 

Leu Pro Val Pro 
55 

Gin Cys Phe Ser 
70 

Lys Ser Ala Met 
85 

Lys Asp Asp Gly 

Asp Thr Leu Val 

120 

Asp Gly Asn lie 
135 

Asn Val Tyr lie 
150 

Phe Lys lie Arg 
165 

His Tyr Gin Gin 

Asp Asn His Tyr 

200 

Glu Lys Arg Asp 
215 

lie Thr His Gly 
230 



Thr Gly Val Val 
10 

His Lys Phe Ser 
25 

Lys Leu Thr Leu 



Trp Pro Thr Leu 

60 

Arg Tyr Pro Asp 
75 

Pro Glu Gly Tyr 
90 

Asn Tyr Lys Thr 
105 

Asn Arg lie Glu 



Leu Gly His Lys 

140 

Met Ala Asp Lys 
155 

His Asn lie Glu 
170 

Asn Thr Pro lie 

185 

Leu Ser Thr Gin 



His Met Val Leu 

220 

Met Asp Glu Leu 
235 



Pro lie Leu Val 
15 

Val Ser Gly Glu 
30 

Lys Phe lie Cys 
45 

Val Thr Thr Phe 



His Met Lys Arg 

80 

Val Gin Glu Arg 
95 

Arg Ala Glu Val 
110 

Leu Lys Gly lie 
125 

Leu Glu Tyr Asn 



Gin Lys Asn Gly 

160 

Asp Gly Ser Val 
175 

Gly Asp Gly Pro 
190 

Ser Ala Leu Ser 
205 

Leu Glu Phe Val 



Tyr Lys 
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Predicted Amino Acid Sequence of F64L-S1 75G-E222G-GFP: 

SEQ ID No.3 



Met Ser Lys "Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
1 5 10 . 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 55 . 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

85 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Gly Val 

165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 

180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Gly Phe Val 
210 215 220 

Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 

225 230 235 
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Predicted Amino Acid Sequence of F64L-S65T-S1 75G-GFP: 

SEQ ID No.4 



Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 55 60 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

85 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 

He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Gly Val 

165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 

180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 
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Flow cytometry of GFP mutations 



F64L:S175G:E222G 



F64L:V163A:E222G 



F64L:S65T;V163A 



F64L:S65T:S175G 



F64L:E222G 



V163A:S175G 



F64L:V163A 



F64L 



V163A:S175G:E222G 



VI 63 A 



F64L:S175G 



S175G 



CO 

c 
o 

E 
O 



V163A:E222G 



S175G:E222G 



E222G 



wtGFP 




0 



10 20 30 40 50 

Fluorescence (geometric mean) 



60 
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Photobleaching of GFP mutations 




0.5 
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0 
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Live Cell CHO-HIR NFkB Assay 
P65-tri GFP Assay T = 30 mins 



1.7 




unstimulated stimulated 



mean 
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<110> Amersham Pharmacia Biotech UK Ltd 

<120> Fluorescent Proteins 

<130> GFP Mutant 

<140> 
<141> 

<150> GBOl/09858.1 
<151> 2001-04-23 

<160> 19 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 717 
<212> DNA 

<213> Aequorea victoria 
<300> 

<301> Chalfie, 
<303> Science 
<304> 263 
<306> 802-805 
<307> 1994 

<400> 1 

atgagtaaag gagaagaact tttcactgga gttgtcccaa ttcttgttga attagatggt 60 
gatgttaatg ggcacaaatt ttctgtcagt ggagagggtg aaggtgatgc aacatacgga 120 
aaacttaccc ttaaatttat ttgcactact ggaaaactac ctgttccatg gccaacactt 180 
gtcactactt tctcttatgg tgttcaatgc ttttcaagat acccagatca tatgaaacgg 240 
catgactttt tcaagagtgc catgcccgaa ggttatgtac aggaaagaac tatatttttc 300 
aaagatgacg ggaactacaa gacacgtgct gaagtcaagt ttgaaggtga tacccttgtt 360 
aatagaatcg agttaaaagg tattgatttt aaagaagatg gaaacattct tggacacaaa 420 
ttggaataca actataactc acacaatgta tacatcatgg cagacaaaca aaagaatgga 480 
atcaaagtta acttcaaaat tagacacaac attgaagatg gaagcgttca actagcagac 540 
cattatcaac aaaatactcc aattggcgat ggccctgtcc ttttaccaga caaccattac 600 
ctgtccacac aatctgccct ttcgaaagat cccaacgaaa agagagacca catggtcctt 660 
cttgagtttg taacagctgc tgggattaca catggcatgg atgaactata caaatag 717 

<210> 2 
<211> 238 
<212> PRT 



1 
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<300> 

<301> Chalfie, 
<303> Science 
<304> 263 
<306> 802-805 
<307> 1994 

<400> 2 

Met Ser Lys Gly 
1 

Glu Leu Asp Gly 

20 

Gly Glu Gly Asp 

35 

Thr Thr Gly Lys 
50 

Ser Tyr Gly Val 
65 

His Asp Phe Phe 



Thx lie Phe Phe 

100 

Lys Phe Glu Gly 
115 

Asp Phe Lys Glu 
130 

Tyr Asn Ser His 

145 

lie Lys Val Asn 



Gin Leu Ala Asp 

180 

Val Leu Leu Pro 
195 



Glu Glu Leu Phe 
5 

Asp Val Asn Gly 



Ala Thr Tyr Gly 

40 

Leu Pro Val Pro 

55 

Gin Cys Phe Ser 
70 

Lys Ser Ala Met 
85 

Lys Asp Asp Gly 



Asp Thr Leu Val 

120 

Asp Gly Asn lie 
135 

Asn Val Tyr lie 
150 

Phe Lys lie Arg 
165 

His Tyr Gin Gin 



Asp Asn His Tyr 

200 



Thr Gly Val Val 
10 

His Lys Phe Ser 
25 

Lys Leu Thr Leu 



Trp Pro Thr Leu 

60 

Arg Tyr Pro Asp 

75 

Pro Glu Gly Tyr 
90 

Asn Tyr Lys Thr 
105 

Asn Arg lie Glu 



Leu Gly His Lys 

140 

Met Ala Asp Lys 
155 

His Asn lie Glu 
170 

Asn Thr Pro lie 
185 

Leu Ser Thr Gin 



Pro lie Leu Val 

15 

Val Ser Gly Glu 
30 

Lys Phe lie Cys 
45 

Val Thr Thr Phe 



His Met Lys Arg 

80 

Val Gin Glu Arg 

95 

Arg Ala Glu Val 
110 

Leu Lys Gly lie 
125 

Leu Glu Tyr Asn 



Gin Lys Asn Gly 

160 

Asp Gly Ser Val 
175 

Gly Asp Gly Pro 
190 

Ser Ala Leu Ser 
205 



2 
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Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 

210 215 220 

Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 3 
<211> 238 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence: synthetic 
protein 

<400> 3 

Met Ser Lys Gly Glu Glu Leu Ph.e Th.r Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 

35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 55 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

85 90 95 

Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
115 120 125 

Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 



3 
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lie Lys Val Asn Phe Lys 

165 

Gin Leu Ala Asp His Tyr 

180 

Val Leu Leu Pro Asp Asn 
195 

Lys Asp Pro Asn Glu Lys 
210 

Thr Ala Ala Gly lie Thr 
225 230 



lie Arg His Asn He Glu 

170 

Gin Gin Asn Thr Pro He 
185 

His Tyr Leu Ser Tlir Gin 
200 

Arg Asp His Met Val Leu 
215 220 

His Gly Met Asp Glu Leu 

235 



PCT/GBOl/04363 

Asp Gly Gly Val 
175 

Gly Asp Gly Pro 
190 

Ser Ala Leu Ser 
205 

Leu Gly Phe Val 



Tyr Lys 



<210> 4 
<211> 238 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
protein 

<400> 4 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 

20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 

35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

50 55 60 

Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

85 90 95 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 



4 
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125 



Asp Phe Lys Glu Asp 
130 

Tyr Asn Ser His Asn 
145 

lie Lys Val Asn Phe 

165 

Gin Leu Ala Asp His 

180 

Val Leu Leu Pro Asp 
195 

Lys Asp Pro Asn Glu 

210 

Thr Ala Ala Gly lie 
225 



Gly Asn lie Leu Gly His 
135 

Val Tyr lie Met Ala Asp 
150 155 

Lys lie Arg His Asn lie 

170 

Tyr Gin Gin Asn Thr Pro 

185 

Asn His Tyr Leu Ser Thr 
200 

Lys Ar,g Asp His Met Val 
215 

Thr His Gly Met Asp Glu 
230 235 



Lys Leu Glu Tyr Asn 
140 

Lys Gin Lys Asn Gly 

160 

Glu Asp Gly Gly Val 

175 

lie Gly Asp Gly Pro 
190 

Gin Ser Ala Leu Ser 
205 

Leu Leu Glu Phe Val 
220 

Leu Tyr Lys 



<210> 5 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Seqpaence: synthetic 
oligonucleotide 

<400> 5 

ggtacgggcc gccaccatga gtaaaggaga agaacttttc ac 42 



<210> 6 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 6 

ggtacgggtt aaccggtttt gtatagttca tccatg 36 



5 
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<210> 7 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 7 

ggtacgggcc gccaccatgg gatccaaagg agaagaactt ttcac 

<210> 8 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 8 

ccaacacttg tcactactct ctcttatggt gttcaat 

<210> 9 
<211> 37 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 9 

attgaacacc ataagagaga gtagtgacaa gtgttgg 

<210> 10 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 



6 
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<400> 10 

ccaacacttg tcactactct cacctatggt gttcaatgct tttca 45 

<210> 11 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<300> 

<301> Chalfie, 
<303> Science 
<304> 263 
<306> 802-805 
<307> 1994 

<400> 11 

tgaaaagcat tgaacaccat aggtgagagt agtgacaagt gttgg 45 

<210> 12 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 12 

gacaaacaaa agaatggaat caaagccaac ttcaaaatta gacac 45 

<210> 13 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 



7 
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<400> 13 



gtgtctaatt ttgaagttgg ctttgattcc attcttttgt ttgtc 



45 



<210> 14 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Seqpaence : synthetic 
oligonucleotide 

<400> 14 

caacattgaa gatggaggcg ttcaactagc agacc 35 

<210> 15 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
oligonucleotide 



<210> 16 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 16 

ccacatggtc cttcttggct ttgtaacagc tgctgg 36 

<210> 17 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<400> 15 



ggtctgctag ttgaacgcct ccatcttcaa tgttg 



35 



8 
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<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 17 

ccagcagctg ttacaaagcc aagaaggacc atgtgg 

<210> 18 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 18 

ttttactcga gatggacgaa ctgttccccc tea 

<210> 19 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : synthetic 
oligonucleotide 

<400> 19 

ttttgaagct tggagctgat ctgactcagc agg 
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